Intel AnAlyzing PArAllel ProgrAms with Pin

نویسندگان

  • Moshe Bach
  • Mark Charney
  • Robert Cohn
  • Elena Demikhovsky
  • Tevi Devor
  • Kim Hazelwood
  • Aamer Jaleel
  • Chi-Keung Luk
  • Gail Lyons
  • Harish Patil
چکیده

Published by the IEEE Computer Society 0018-9162/10/$26.00 © 2010 IEEE Instrumentation is one tool for collecting the information needed to understand programs. Instrumentationbased tools typically insert extra code into a program to record events during execution.1-4 The cost of executing the extra code can be as low as a few cycles, enabling fine-grained observation down to the instruction level. Pin2 (www.pintool.org) is a software system that performs runtime binary instrumentation of Linux and Microsoft Windows applications. Pin’s aim is to provide an instrumentation platform for building a wide variety of program analysis tools, called pintools. By performing the instrumentation on the binary at runtime, Pin eliminates the need to modify or recompile the application’s source and supports the instrumentation of programs that dynamically generate code.

منابع مشابه

Analyzing MPI programs with Intel® VTuneTM Amplifier XE and Intel® Inspector XE tools

This document is a supplement to the Intel® Cluster Studio XE and Intel® Parallel Studio XE documentation. It describes how to use the Intel® VTuneTM Amplifier XE and Intel® Inspector XE tools for analyzing MPI workloads.

متن کامل

Debugging Multi-Threaded Applications using Pin-Augmented GDB (PGDB)

In contrast to single threaded applications, debugging multi-threaded applications is complex because of the non-deterministic nature of concurrent programs. Multiple threads in concurrent programs introduce bugs like datarace, deadlock and livelock. Popular debuggers like GNU Debugger (GDB), Intel Debugger (IDB) and Microsoft Visual Studio Debugger (MVSD) typically use static or compile-time i...

متن کامل

On Analyzing the Cost of Communication in Networks

s of Talks Jean-Loup Baer On the Performance of Cluster Architectures Yosi Ben-Asher The Partitioned PRAM Model, Re-Doing Basic PRAM Algorithms with Limited Resources Artur Czumaj Adaptitive Allocation Processes Ralf Diekmann Load Balancing Strategies for Scientific Computing Applications Martin Dietzfelbinger On Analyzing the Cost of Communication in Networks Pierre Fraigniaud A General Theory...

متن کامل

Automatic Transformations for Effective Parallel Execution on Intel Many Integrated Core

We demonstrate in this work the potential effectiveness of a source-to-source framework for automatically optimizing a sub-class of affine programs on the Intel Many Integrated Core Architecture. Data locality is achieved through complex and automated loop transformations within the polyhedral framework to enable parallel tiling, and the resulting tiles are processed by an aggressive automatic ...

متن کامل

PhiTM for DNA Sequence Analysis

Genetic information is increasing exponentially, doubling every 18 months. Analyzing this information within a reasonable amount of time requires parallel computing resources. While considerable research has addressed DNA analysis using GPUs, so far not much attention has been paid to the Intel Xeon Phi coprocessor. In this paper we present an algorithm for large-scale DNA analysis that exploit...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

متن کامل
عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010